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Abstract 



o . 

■ We study the problem of polynomial identity testing (PIT) for depth 2 arithmetic circuits 

over matrix algebra. We show that identity testing of depth 3 (SnE) arithmetic circuits over a 
field F is polynomial time equivalent to identity testing of depth 2 (JIT,) arithmetic circuits over 
U2(F), the algebra of upper-triangular 2x2 matrices with entries from F. Such a connection is 
a bit surprising since we also show that, as computational models, HI] circuits over U2(F) are 
strictly 'weaker' than SIIS circuits over F. 

The equivalence further shows that PIT of depth 3 arithmetic circuits reduces to PIT of 
width-2 planar commutative Algebraic Branching Programs{ABP). Thus, identity testing for 
i commutative ABPs is interesting even in the case of width-2. 

I Further, we give a deterministic polynomial time identity testing algorithm for a HE circuit 

• . over any constant dimensional commutative algebra over F. While over commutative algebras 
Q I of polynomial dimension, identity testing is at least as hard as that of SHS circuits over F. 

^ , 1 Introduction 
00 . 

lO . Polynomial identity testing (PIT) is a fundamental problem in theoretical computer science. Over 

I the last decade this problem has drawn significant attention from many leading researchers owing 

• ■ to its role in designing efficient algorithms and in proving circuit lower bounds. Identity testing is 
, the following problem: 

0\ . 

, Problem 1.1. Given an arithmetic circuit C with input variables xi, . . . ,Xn and constants taken 

from a field ¥, check if the polynomial computed by C is identically zero. 

' I 

^ ■ Besides being a natural problem in algebraic computation, identity testing appears in important 

^ ! complexity theory results such as, IP = PSPACE jShaQO] and the PCP theorem [ALM+98] . It also 

plays a promising role in proving super-polynomial circuit lower bound for permanent [KI03ilAgr05|. 
Moreover, algorithms for problems like primality testing |AKS04] . graph matching [Lov79j and 
multivariate polynomial interpolation [CDGKOT] also involve identity testing. 

The first randomized polynomial time algorithm for identity testing was given by Schwartz and 
Zippel |Sch80l |Zip79| . Several other efficient randomized algorithms |CK97l ILVMI IAB991 IKSOT] 
came up subsequently, resulting in a significant improvement in the number of random bits used. 
However, despite many attempts a deterministic polynomial time algorithm has remained elusive. 
Nevertheless, important progress has been made both in the designing of deterministic algorithms 
for special circuits, and in the understanding of why a general deterministic solution could be hard 
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to get. 



Without loss of generality, we can assume that a circuit C has alternate layers of addition and 
multiplication gates. A layer of addition gates is denoted by S and that of multiplication gates 
is denoted by 11. Kayal and Saxena |KS07j gave a deterministic polynomial time identity testing 
algorithm for depth 3 (SIIE) circuits with constant top fan-in. As such, no other general poly- 
nomial time result is known for depth 3 circuits. A justification behind the hardness of PIT even 
for small depth circuits was provided recently by Agrawal and Vinay [AV08j . They showed that a 
deterministic black box identity test for depth 4 (SllSn) circuits would imply a quasi-polynomial 
time deterministic PIT algorithm for any circuit computing a polynomial of low degreqj. 

Thus we see that the non-trivial case for identity testing starts with depth 3 circuits; whereas 
circuits of depth 4 are almost the general case. It is therefore natural to ask as to what is the 
complexity of the PITproblem for depth 2 (HE) circuits if we allow the constants of the circuit to 
come from an algebra n T^, which is not a field, and has dimension over F, dimp (7^) > 1. We can 
make the reasonable assumption that the algebra TZ is given in basis form i.e. we know an F-basis 
{ei, . . . , Ck} of TZ and we also know how eiCj can be expressed in terms of the basis elements, for 
all i and j. Therefore, the problem at hand is the following, 

Problem 1.2. Given an expression, 

d 

P = Yl (^»o + Aixi + ... + A 

in^n ) 

1=1 

where Aij G TZ, an algebra over F given in basis form, check if P is zero. 

How hard is the above problem? At first sight, this problem might look deceptively simple. For 
instance, if TZ is field or even a division algebra (say, the real quaternion algebra) then it is trivial 
to check if P = using polynomial number of F-operations. However, in general this is far from 
what might be the case. 

Since elements of a finite dimensional algebra, given in basis form, can be expressed as matrices 
over F we can equivalently write the above problem as. 

Problem 1.3. Given an expression, 

d 

P = W (^*0 + AiiXi + ...+ AinXn) (1) 
i=l 

where Aij € Mfc(F), the algebra of k x k matrices over¥, check if P is zero using poly {knd) number 
of ¥ -operations. 

In order to avoid confusion we would use the following convention in this paper: 

Convention - Whenever we say 'arithmetic circuit' or 'arithmetic formula' without any extra 
qualification, we would mean a circuit or a formula over a field. Otherwise, we would explicitly 
mention 'arithmetic circuit (or formula) over some algebra' to mean that the constants of the circuit 
are taken from 'that' algebra. 

polynomial is said to have low degree if its degree is less than the size of the circuit 
^In this paper we always mean a finite dimensional associative algebra with unity. 
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1.1 The depth 2 model of computation 

A depth 2 circuit C over matrices, as in Equation [H naturally defines a computational model. 
Assuming TZ = Mfc(F), for some k, a polynomial P G TZlxi, . . . outputted by C can be viewed 
as 8l k X k matrix of polynomials in F[xi, . . . , 1. We say that a polynomial / G . . . , x^] is 
computed by C if one of the k^ polynomials in P is /. Sometimes we would abuse terminology a 
bit and say that P computes f to mean the same. 

In the following discussion, we would denote the algebra of upper-triangular k x k matrices by 
Ufc(F). The algebra U2(F) is the smallest noncommutative algebra with unity over F, in the sense 
that dimp U2(F) = 3 whereas any algebra with unity of dimension less than 3 is commutative. We 
show in this paper that already U2(F) captures an open case of identity testing. 

Ben-Or and Cleve [BC88I showed that a polynomial computed by an arithmetic formula E of 
depth d, and fan-in (of every gate) bounded by 2, can also be computed by a straight-line program 
of length at most 4'^ using only 3 registers. The following fact can be readily derived from their 
result (see Theorem lA.ip : From an arithmetic formula E of depth d and fan- in bounded by 2, we 
can efficiently compute the expression, 

m 

P = Yl i^O + AiiXi + ...+ AinXn) 
i=l 

where m < 4*^ and Aij G M3(F) such that P computes the polynomial that E does. Thus solving 
Problem 11.31 in polynomial time even for 3x3 matrices yields a polynomial time algorithm for PIT 
of constant depth circuits, in particular depth 4 circuits. There is another way of arguing that the 
choice of TZ as M3(F) is almost the general case. 

For an arithmetic circuit of size s, computing a low degree polynomial, we can use the depth- 
reduction result by Allender, Jiao, Mahajan and Vinay |AJMV98j (see also |VSBR83] ) to get an 
equivalent bounded fan-in formula of size s'^('°s*) and depth O(log^s). From this formula we can 
obtain a depth 2 circuit over M3(F) of size 4*^^'°^ = s'^('°s*) (using Ben-Or and Cleve's result) 
that computes the same polynomial as the formula. Thus, PIT for depth 2 circuits over 3x3 
matrices is almost the general case since a derandomization yields a quasi-polynomial time PIT 
algorithm for any circuit computing a low degree polynomial. This means, in essence a depth 2 
circuit over M3(F) plays the role of a depth 4 circuit over F (using Agrawal and Vinay's result). 

What is natural to ask is how the complexity of PIT for depth 2 circuits over M2(F) relates 
to PIT for arithmetic circuits. In this paper, we provide an answer to this. We show a surprising 
connection between PIT of depth 2 circuits over U2(F) and PIT of depth 3 circuits. The reason this 
is a bit surprising is because we also show that, a depth 2 circuit over U2(F) is not even powerful 
enough to compute a simple polynomial like, xiX2 + X3X4 + x^xqI 

Known related models 

Polynomial identity testing and circuit lower bounds have been studied for different algebraic 
models. Nisan ^Nis91) showed an exponential lower bound on the size of any arithmetic formula 
computing the determinant of a matrix in the non-commutative free algebra model. The result 
was later generalized by Chien and Sinclair |CS04] to a large class of non-commutative algebras 
satisfying polynomial identities, called Pl-algebras. Identity testing has also been studied for the 
non-commutative model by Raz and Shpilka [RS04j . Bogdanov and Wee |BW05] . and Arvind, 
Mukhopadhyay and Srinivasan [AMS08j . But unlike this model where the variables do not com- 
mute, in our setting the variables always commute but the constants are taken from an algebra TZ. 
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The motivation for studying this later model is not only because it is a natural generalization of 
commutative circuits over fields but also because it gives a different perspective to the complexity 
of the classical PIT problem in terms of the dimension of the underlying algebra TZ. It seems to 
'pack' the combinatorial nature of the circuit into a larger base algebra and hence opens up the 
possibility of using algebra structure results. The simplest nontrivial circuit in this model is a depth 
2 circuit over the smallest non-commutative algebra which is 7^ = U2(F). 

1.2 Our Results 

The results we give are of two types. Some are related to identity testing while the rest are related 
to the weakness of the depth 2 computational model over U2(F) and M2(F). 

Identity testing 

We fill in the missing information about the complexity of identity testing for depth 2 circuits over 
2x2 matrices by showing the following result. 

Theorem 1.4. Identity testing for depth 2 (TIT,) circuits over U2(F) is polynomial time equivalent 
to identity testing for depth 3 fSlIS ) circuits. 

The above result has an interesting consequence on identity testing for Algebraic Branching Pro- 
gram (ABP) |Nis91j (see Definition I2.2p . It is known that identity testing for non-commutative 
ABP can be done in deterministic polynomial time (a result due to Raz and Shpilka [RS04] ) . But no 
interesting result is known for identity testing of even width-2 commutative ABP's. The following 
result explains why this is the case. 

Corollary 1.5. Identity testing of depth 3 circuits reduces to identity testing of width-2 planar 
ABPs. 

Further, we give a deterministic polynomial time identity testing algorithm for depth 2 circuits 
over any constant dimensional commutative algebra given in basis form. Recall that an algebra 
TZ is given in basis form if we know an F-basis {ei, . . . , e^} of TZ and we also know how ejCj can 
be expressed in terms of the basis elements, for all i and j. Our result can be formally stated as 
follows. 

Theorem 1.6. Given an expression, 

d 

P = W {Aio + Aiixi + ... + A 

in^n ) 

i=l 

where Aij £ TZ, a commutative algebra of constant dimension over ¥ that is given in basis form, 
there is a deterministic polynomial time algorithm to test if P is zero. 

In a way, this result establishes the fact that the power of depth 2 (IIS) circuits over constant di- 
mensional algebras is primarily derived from the non-commutative nature of the algebra. However, 
things can be very different for commutative algebras of polynomial dimension over F. 

Theorem 1.7. Identity testing of a depth 3 (SIIS) circuit C reduces to identity testing of a depth 
2 (nS) circuit over a commutative algebra of dimension polynomial in the size of C. 
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It would be apparent from the proof of Theorem 11.41 that our argument is simple in nature. 
Perhaps the reason why such a connection was overlooked before is that, unlike a depth 2 circuit 
over M3(F), we do not always have the privilege of exactly computing a polynomial over F using a 
depth 2 circuit over U2(F). Showing this weakness of the latter computational model constitutes 
the other part of our results. 

Weakness of the depth 2 model over U2(F) and M2(F) 

Although Theorem 11.41 shows an equivalence of depth 3 circuits and depth 2 circuits over U2(F) with 
respect to PIT, the computational powers of these two models are very different. The following 
result shows that a depth 2 circuit over U2(F) is computationally strictly weaker than depth 3 
circuits. 

Theorem 1.8. Let f e F[ xi, . . . ,Xn] be a polynomial such that there are no two linear functions 
li and I2 (with 1 ^ (^1,^2); the ideal generated by li and I2) which make f mod {hjh) also a linear 
function. Then f is not computable by a depth 2 circuit over U2(F). 

It can be shown that even a simple polynomial like X1X2 + 3^3X4 + x^xq satisfies the condition 
stated in the above theorem (see Corollary 14. ip , and hence it is not computable by any depth 2 
circuit over U2(F), no matter how large! This contrast makes Theorem 11.41 surprising as it estab- 
lishes an equivalence of identity testing in two models of different computational strengths. 

At this point, it is natural to investigate the computational power of depth 2 circuits if we 
graduate from U2(F) to M2(F). The following result hints that even such a model is severely 
restrictive in nature. 

A depth 2 circuit over l\/l2(F) gives as output a polynomial P = HiLi (^«o + Anxi + . . . + AinXn), 
with Aij G M2(F). Let denote the partial product P^ = ni=£ i^io + ^n^i + • • • + AinXn) where 

Definition 1.9. A polynomial f S F[xi, . . . is computed by a depth 2 circuit over M2(F) under 
a degree restriction of m if the degree of each of the partial products Pi is bounded by m. 

Theorem 1.10. There exists a class of polynomials of degree n that cannot be computed by a depth 
2 circuit over M2(F), under a degree restriction of n. 

The motivation for imposing a condition like degree restriction comes very naturally from depth 
2 circuits over M3(F). Given a polynomial / = m-j, where mj's are the monomials of /, it is easy 
to construct a depth 2 circuit over M3(F) that literally forms these monomials and adds then one by 
one. This computation is degree restricted, if we extend our definition of degree restriction to M3(F). 
However, the above theorem suggests that no such scheme to compute / would succeed over M2(F). 

Remark- By transferring the complexity of an arithmetic circuit from its depth to the dimension 
of the underlying algebras while fixing the depth to 2, our results provide some evidence that 
identity testing for depth 3 circuits appears to be mathematically more tractable than depth 4 
circuits. Besides, it might be possible to exploit the properties of these underlying algebras to say 
something useful about identity testing. A glimpse of this indeed appears in our identity testing 
algorithm over commutative algebras of constant dimension over F. 
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1.3 Organization 



The results on identity testing are given in sections [2] and [3l while those on the weakness of the 
depth 2 model are given in section SI In section [2] we prove the equivalence of identity testing 
between depth 3 circuits and depth 2 circuits over U2(F) (Theorem ll.4p . and show how it connects 
to width-2 ABPs (Corollarv ll.Sp . The deterministic polynomial time identity testing algorithm 
over commutative algebra of constant dimension is presented in section [3] (Theorem II. 6p . In the 
same section it is also shown that commutative algebras of polynomial dimensions are powerful 
enough to capture PIT of depth 3 (SIIS) circuits ( Theorem 1 1.7|) . Finally, in section H] we show the 
weakness of the depth 2 model over U2(F) and M2(F) (Theorem 11.81 and II. lOp . 



2 Identity testing over M2(F) 

In this section, we show that PIT of depth 2 circuits over M2(F) is at least as hard as PIT of depth 

3 circuits, and this further implies that PIT of a width-2 commutative ABP is also at least as hard 
as PIT of depth 3 circuits. 



2.1 Equivalence with depth 3 identity testing 



We will now prove Theorem ll.4l Given a depth 3 circuit we can assume, without loss of generality, 
that the fan-in of the multiplication gates are the same. This multiplicative fan-in will be referred 
to as the degree of the depth 3 circuit. The following lemma is the crux of our argument. For 
convenience, we will call a matrix with linear functions as entries, a linear matrix. 

Lemma 2.1. Let / G F[ xi, . . . ,Xn] be a polynomial computed by a depth 3 circuit C of degree d 
and top level fan-in s. Given circuit C , it is possible to construct in polynomial time a depth 2 
circuit over U2(F) of size 0{{d + n)s'^) that computes a polynomial p = L ■ f , where L is a product 
of non-zero linear functions. 

Proof. A depth 2 circuit over U2(F) is simply a product sequence of 2 x 2 upper-triangular linear 
matrices. We now show that there exists such a sequence of length 0{{d + n)s^) such that the 
product 2x2 matrix has L ■ f as one of its entries. 

Since / is computed by a depth 3 circuit, we can write / = J2i=i^i^ where each summand 
Pi = Y\j hj is a product of linear functions. Observe that we can compute a single Pi using a 
product sequence of length d as: 



hi 




li2 




h{d-i) 




1 kd 




' L' Pi' 


1 




1 




1 




1 




1 



(2) 



where L' = hi ■ ■ ■ li(d-i)- 
Each matrix of the form 



1 I 
1 



where I = uq + be further expanded as, 



1 Oo 




1 aixi 




1 (XjiXyi 




' I r 


1 




1 




1 




1 



These will be the only type of non-diagonal matrices that would appear in the sequence. 
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The proof will proceed by induction where Equation [2] serves as the induction basis. A generic 

Li L2g 



intermediate matrix would look like 



where each Li is a product of non-zero linear 



functions and g is & partial summand of Pi's. We shall inductively double the number of summands 
in g at each step. 



At the i-th iteration let us assume that we have the matrices 



Li 



L2g 
L3 



and 



Ml M2h 
M3 



each computed by a sequence of rii linear matrices. We now want a sequence that computes a 



polynomial of the form L 


■{g + h). 


Consider the following sequence, 




Li L2g 




' A 




" Ml M2/1 ' 




" ALiMi ALiM2h + BL2M3g 




L3 




B 




M3 




BL3M3 



where A, B are products of linear functions. By setting A 
desired sequence, 



L2M3 and B 



(3) 

L1M2 we get the 



Li L25 




" A 




' Ml M2/1 ' 




" L1L2M1M3 LiL2M2M3{g + h) 


L3 . 




B 




M3 




L1L3M2M3 



This way, we have doubled the number of summands in g + h. The length of the sequence computing 
L2g and M2h is rij, hence each Lj and Mj is a product of rii many linear functions. Therefore, both 

r ^ 1 

A and B are products of at most 2nj linear functions and the matrix ^ can be written 

as a product of at most 2nj diagonal linear matrices. The total length of the sequence given in 
Equation [3] is hence bounded by Arii . 

The number of summands in / is s and the above process needs to be repeated at most log s + 1 
times. The final sequence length is hence bounded by {d + n) ■ 4'°^* = (c? + n)s^. □ 



It follows from Lemma l2. II that, given a depth 3 circuit C computing / we 

Li L-f 



Proof of Theorem [77 

can efficiently construct a depth 2 circuit over U2(F) that outputs a matrix, 



L is a product of non-zero linear functions. Multiplying this matrix by 



1 



, where 

^2 

to the left and 







to the right yields another depth 2 circuit D that outputs 







L-f 




. Thus D computes 



an identically zero polynomial over U2(F) if and only if C computes an identically zero polynomial. 
This shows that PIT for depth 3 circuits reduces to PIT of depth 2 circuits over U2(F). 

The other direction, that is PIT for depth 2 circuits over U2(F) reduces to PIT for depth 3 
circuits, is trivial to observe. The diagonal entries of the output 2x2 matrix is just a product 
of linear functions whereas the off-diagonal entry is a sum of at most d' many products of linear 
functions, where d' is the multiplicative fan-in of the depth 2 circuit over U2(F). □ 



2.2 Width-2 algebraic branching programs 



Algebraic Branching Program (ABP) is a model of computation defined by Nisan [Nis91] . Formally, 
an ABP is defined as follows. 



Definition 2.2. (Nisan \Nis91^ ) An algebraic branching program (ABP) is a directed acyclic graph 
with one source and one sink. The vertices of this graph are partitioned into levels labelled to d, 
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where edges may go from level i to level i + 1. The parameter d is called the degree of the ABP. The 
source is the only vertex at level and the sink is the only vertex at level d. Each edge is labelled 
with a homogeneous linear function of xi, . . . , Xn (i-e. a function of the form CiXi). The width 
of the ABP is the maximum number of vertices in any level, and the size is the total number of 
vertices. 

An ABP computes a function in the obvious way; sum over all paths from source to sink, the 
product of all linear functions by which the edges of the path are labelled. 

An ABP is said to be planar if the underlying graph is planar. 



The following argument shows how Corollary 11.51 follows easily from Theorem 11.41 
Proof of Corollary \1.5[ Theorem II .41 constructs a depth 2 circuit D that computes P 



n,(^.o+ 

Ajixi + ... + AjnXn), where each Aji £ U2(F). We can make D homogeneous by introducing 
an extra variable z, such that P = Y\j{AjQZ + AjiXi + . . . + AjnXn). This means, the product 
sequence considered in Lemma 12. H is such that all the linear matrices have homogeneous linear 

z err. 



functions as entries and the only non-diagonal linear matrices are of the form 



It is now 



straightforward to construct a width-2 ABP by making the j*'' linear matrix in the sequence act as 
the adjacency matrix between level j and j ' + 1 of the ABP. The ABP constructed is planar since 
it has layers only of the following two kinds: 




where li. I2 are homogeneous linear functions. 



□ 



As a matter of fact, the above argument actually shows that PIT of depth 2 circuits over M2(F) 
reduces to PIT of width-2 ABPs. 



3 Identity testing over commutative algebras 

We would now prove Theorem 11.61 The main idea behind this proof is a structure theorem for 
finite dimensional commutative algebras over a field. To state the theorem we need the following 
definition. 

Definition 3.1. A ring TZ is local if it has a unique maximal ideal. 

An element u in a ring TZ is said to be a unit if there exist an element u' such that uu' = 1, 
where 1 is the identity element of TZ. An element m £ TZis nilpotent if there exist a positive integer 
n with m" = 0. In a local ring the unique maximal ideal consists of all non-units in TZ. 

The following theorem shows how a commutative algebra decomposes into local sub-algebras. 
The theorem is quite well known in the theory of commutative algebras. But since we need an 
effective version of this theorem, we present the proof here for the sake of completion and clarity. 
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Theorem 3.2. A finite dimensional commutative algebra TZ over¥ is isomorphic to a direct product 
of local rings i.e. 

7^ ^ 7^l e . . . e 

where each TZi is a local ring contained in TZ and any non-unit in IZi is nilpotent. 

Proof. If all non-units in IZ are nilpotents then 7^ is a local ring and the set of nilpotents forms the 
unique maximal ideal. Therefore, suppose that there is a non-nilpotent zero-divisor z in TZ. (Any 
non-unit z in a finite dimensional algebra is a zero-divisor i.e. 3y TZ and y ^ such that yz = 0.) 
We would argue that using z we can find an idempotent v {0, 1} in TZ i.e. u ^ = v. 

Assume that we do have a non-trivial idempotent v (z TZ. Let TZv be the sub-algebra of TZ gen- 
erated by multiplying elements of TZ with v. Since any a = av + a(l — v) and TZv n TZ{1 — v) = {0}, 
we get TZ = TZv(BTZ{l — v) as a non-trivial decomposition of TZ. (Note that 7^ is a direct sum of the 
two sub-algebras because for any a G TZv and b € TZ{1 — v), a ■ b = 0. This is the place where we 
use commutativity of TZ.) By repeating the splitting process on the sub-algebras we can eventually 
prove the theorem. We now show how to find an idempotent from the zero-divisor z. 

An element a (zTZ can be expressed equivalently as a matrix in Mfc(F), where k = dimf(TZ), by 
treating a as the linear transformation on TZ that takes 6 E 7^ to a • 6. Therefore, z is a zero-divisor 
if and only if z as a matrix is singular. Consider the Jordan normal form of z. Since it is merely 
a change of basis we would assume, without loss of generality, that z is already in Jordan normal 
form. (We won't compute the Jordan normal form in our algorithm, it is used only for the sake of 
argument.) Let, 

_ ' A " 

^~ [ N _ 

where A, N are block diagonal matrices and A is non-singular and N is nilpotent. Therefore there 
exits a positive integer t < k such that, 

t \ B 0' 
= [ 0_ 

where B = A^ is non-singular. The claim is, there is an identity element in the sub-algebra TZw 
which can be taken to be the idempotent that splits TZ. To see this first observe that the minimum 
polynomial of w over F is m{x) = x ■ m'{x), where m'{x) is the minimum polynomial of B. Also if 
m(x) = X^iLi Q^i^* then ai 7^ as it is the constant term of m'{x) and B is non-singular. Therefore, 
there exists an a G 7^ such that w ■ (aw — 1) = 0. We can take v = aw as the identity element in 
the sub-algebra TZw. This v {0, 1} is the required idempotent in TZ. □ 

We are now ready to prove Theorem 11.61 

Theorem 11.61 (restated.) Given an expression, 

d 

P = Y[ {Aio + Aiixi + ... + A 

i=l 

where Aij ^ TZ, a commutative algebra of constant dimension over ¥ that is given in basis form, 
there is a deterministic polynomial time algorithm to test if P is zero. 
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Proof. Suppose, the elements ei, . . . , form a basis of TZ over F. Since any element in TZ can be 
equivalently expressed as a k x k matrix over F (by treating it as a linear transformation), we will 
assume that Aij G l\/lfc(F), for all i and j. Further, since TZ is given in basis form, we can find these 
matrix representations of Aij^s efficiently. 

If every Aij is non-singular, then surely P ^ 0. (This can be argued by fixing an ordering 
xi y X2 >~ ■ ■ ■ y Xn among the variables. The coefficient of the leading monomial of P, with 
respect to this ordering, is a product of invertible matrices and hence P ^ 0.) Therefore, assume 
that ^Aij = z such that z is a zero-divisor i.e. singular. From the proof of Theorem 13.21 it follows 
that there exists a t < k such that the sub- algebra TZw, where w = z^, contains an identity element 
V which is an idempotent. To find the right w we can simply go through all 1 < t < k. We now 
argue that for the correct choice of w, v can be found by solving a system of linear equations over 
F. Let 6i, . . . , be a basis of TZw, which we can find easily from the elements eiw, . . . , e^w. In 
order to solve for v write it as, 

V = i^ibi + . . . + i^k'bk' 

where Uj G F are unknowns. Since v is an identity in TZw we have the following equations, 

(1^161 -|- . . . -|- Uk'bk') ■ bi = bi for 1 <i <k' . 

Expressing each bi in terms of ei, . . . , e^, we get a set of linear equations in Vj^s. Thus for the right 
choice of w (i.e. for the right choice of t) there is a solution for v. On the other hand, a solution 
for V for any w gives us an idempotent, which is all that we need. 

Since TZ = TZv®TZ{l—v) we can now split the identity testing problem into two similar problems, 
i.e. P is zero if and only if, 

d 

-Pt" = (^io^ + Aiiv ■ xi + ... + AinV ■ Xn) and 

i=l 
d 

P{l-V) = ]\{AiQ{l-v) + Aii{l-v)-Xi + ...+Ain{l-v)-Xn) 

i=l 

are both zero. What we just did with P € 7^ we can repeat for Pv € TZv and P(l — € TZ{1 — v). 
By decomposing the algebra each time an Aij is a non-nilpotent zero-divisor, we have reduced the 
problem to the following easier problem of checking if 

d 

P = n (^»0 + + . . . + AinXn) 

1=1 

is zero, where the coefficients Aij^s are either nilpotent or invertible matrices. 

Let Tj = {AiQ + Aiixi + . . . + AinXn) be a term such that the coefficient of Xj in T,, i.e. Aij 
is invertible. And suppose Q be the product of all terms other than Tj. Then P = Ti ■ Q (since 
TZ is commutative). Fix an ordering among the variables so that Xj gets the highest priority. The 
leading coefficient of P, under this ordering, is Aij times the leading coefficient of Q. Since Aij 
is invertible this implies that P = if and only if Q = 0. (If AiQ is invertible, we can arrive at 
the same conclusion by arguing with the coefficients of the least monomials of P and Q under 
some ordering.) In other words, P = if and only if the product of all those terms for which 
all the coefficients are nilpotent matrices is zero. But this is easy to check since the dimension of 
the algebra. A; is a constant. (In fact, this is the only step where we use that A: is a constant.) If 
number of such terms is greater than k then P is automatically zero (this follows easily from the 
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fact that the commuting nilpotent matrices can be simultaneously triangularized with zeroes in 
the diagonal). Otherwise, simply multiply those terms and check if it is zero. This takes 0{n!^) 
operations over F. □ 

It is clear from the above discussion that identity testing of depth 2 (IIS) circuits over commutative 
algebras reduces in polynomial time to that over local rings. As long as the dimensions of these 
local rings are constant we are through. But what happens for nonconstant dimensions? The 
following result justifies the hardness of this problem. 

Theorem 11.71 (restated.) Given a depth 3 (SIIS) circuit C of degree d and top level fan-in s, it 
is possible to construct in polynomial time a depth 2 (IIS) circuit C over a local ring of dimension 
s{d — 1) + 2 over F such that C computes a zero polynomial if and only if C does so. 

Proof. The proof is relatively straightforward. Consider a depth 3 (SIIS) circuit computing a poly- 
nomial / = Yli=i TYj=i kj, where /jj's are linear functions. Consider the ring TZ = ¥[yi, . . . , ys]/2r, 
where I is an ideal generated by the elements {yiyj}i<i<j<s and {yf — yf }i<j<s- Observe that TZis a 
local ring, as yf~^^ = for all 1 < i < s. Also the elements {1, yi, . . . , yf, y2, . . . , 2/2"^' ■ ■ ■ ^Vs, ■ ■ ■ , yf"^} 
form an F-basis of TZ. Now notice that the polynomial, 

d 

^ = n + • • • + J-jsys) 
= f-yf 

is zero if and only if / is zero. Polynomial P can indeed be computed by a depth 2 (IIS) circuit 
over TZ. □ 



4 Weakness of the depth 2 model 

In Lemma [2. 11 we saw that the depth 2 circuit over U2(F) computes L ■ f instead of /. Is it possible 
to drop the factor L and simply compute /? In this section, we show that in many cases it is 
impossible to find a depth 2 circuit over U2(F) that computes /. 

4.1 Depth 2 model over U2(F) 

We will now prove Theorem ll.8i In the following discussion we use the notation {hjh) to mean 
the ideal generated by two linear functions h and I2. Further, we say that h is independent of I2 

if 1 0(/l,/2). 



Theorem 11.81 (restated.) Let f G F[xi, . . . , Xn] be a polynomial such that there are no two linear 
functions li and I2 (with 1 [h^h)) which make f mod (hjh) Silso a linear function. Then f is 
not computable by a depth 2 circuit over U2(F). 

Proof. Assume on the contrary that / can be computed by a depth 2 circuit over U2(F). In other 
words, there is a product sequence Mi • • • Mt of 2 x 2 upper-triangular linear matrices such that / 

kl k2 



appears as the top-right entry of the final product. Let Mj 



i3 



then 



/= [ 1 



^11 h2 




hi h2 




kl k2 




" " 


h3 . 




h3 . 




Its . 




1 



(4) 
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Case 1: Not all the liis are constants. 

Let k be the least index such that l^i is not a constant and In 
Equation m let 



Ci for all i < k. To simplify 



B 
L 



[ di D, 



Mk+i ■■■Mt 
[1 ] • Ml • • • Mi_i 




1 



Observe that L is just a product of linear functions, and for all 1 < i < A;, we have the following 
relations. 



di+i 



dili2 + lizDi 



Hence, Equation |4] simplifies as 

/ 



hi h2 




' B ' 






L 



[ dk Dk 
dkhiB + {dkh2 + hsDk) L 



Suppose there is some factor / of L with 1 ^ {lki,l)- Then / = mod (hi, I), which is not 
possible. Hence, L must be a constant modulo hi- For appropriate constants a, (3, we have 



/ = ah2 + PhsDk (mod ki) 



(5) 



We argue that the above equation cannot be true by inducting on k. If 1^3 was independent of 
hi, then / = ah2 mod (hi^hs) which is not possible. Therefore, must be a constant modulo 
hi- We then have the following (reusing a and /? to denote appropriate constants): 

/ = ah2 + (3Dk (mod /fei) 

= ah2 + l3 {dk^il{k-i)2 + kk-i)3Dk-i) (mod hi) 
= (a^fc2 + /54-i^(fc-i)2) + /5^(fc-i)3^A:-i (mod /fci) 

The last equation can be rewritten in the form of Equation [5] with piksDk replaced by j3l(^k-i)zDk-i- 
Notice that the expression {Q.h2 + I3dk^il[k~i)2) is linear just like ah2- Hence by using the argu- 
ment iteratively we eventually get a contradiction at Di. 

Case 2: All the /ji's are constants. 

In this case. Equation |4] can be rewritten as 



/ 



[dt Dt] 
dtk2 + ksDt 



Ct lt2 




" " 


Its . 




1 



The last equation is again of the form in Equation [5] (without the mod term) and hence the same 
argument can be repeated here as well to give the desired contradiction. □ 
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The following corollary provides some explicit examples of functions that cannot be computed. 

Corollary 4.1. A depth 2 circuit over U2(F) cannot compute the polynomial xiX2 + X3X4 + x^xq. 
Other examples include well known functions like detn and perm„, the determinant and permanent 
polynomials, for n >3. 

Proof. It suffices to show that / = X1X2 + X3X4 + xr,XQ satisfy the requirement in Theorem 1 1.8i 

To obtain a contradiction, let us assume that there does exist two linear functions li and I2 
(with 1 (^1, 12)) such that / mod (^i, I2) is linear. We can evaluate / mod {li, I2) by substituting a 
pair of the variables in / by linear functions in the rest of the variables (as dictated by the equations 
h = h = 0). By the symmetry of /, we can assume that the pair is either {xi,X2} or {xijX^}. 

If xi = l[ and X3 = I2 are the substitutions, then l'^X2 + I2X4 can never contribute a term to 
cancel off x^xq and hence / mod (hjh) cannot be linear. 

Otherwise, let xi = I'l and X2 = I2 be the substitutions. If / mod {li, I2) = l'il'2 + 2^3X4 + x^x% 
is linear, there cannot be a common Xi with non-zero coefficient in both and /g- Without loss of 
generality, assume that involves 2:3 and X5 and I2 involves 3:4 and xq. But then the product /^?2 
would involve terms like X2,xq that cannot be cancelled, contradicting linearity again. □ 



4.2 Depth 2 model over M2(F) 



In this section we show that the power of depth 2 circuits is very restrictive even if we take the 
underlying algebra to be M2(F) instead of U2(F). In the following discussion, we will refer to a 
homogeneous linear function linear form. 

Definition 4.2. A polynomial f of degree n is said to be r-robust if f does not belong to any ideal 
generated by r linear forms. 

For instance, it can be checked that det„ and perm„, the symbolic determinant and permanent of 
an n X n matrix, are (n — l)-robust polynomials. For any polynomial /, we will denote the d^^ 
homogeneous part of / by [f]d. And let {hi, • • • , hk) denote the ideal generated by /ii, • • • , hk- For 
the following theorem recall the definition of degree restriction (Definition II. 9p given in Section [TJ 

Theorem 4.3. A polynomial f of degree n, such that [/]„ is 5-robust, cannot be computed by a 
depth 2 circuit over M2(F) under a degree restriction of n. 

We prove this with the help of the following lemma, which basically applies Gaussian column 
operations to simplify matrices. 

Lemma 4.4. Let fi be a polynomial of degree n such that [fi]n is A-robust. Suppose there is a 
linear matrix M and polynomials /2, 51,92 of degree at most n satisfying 



fi 

f2 



M 



91 

92 



Then, there is an appropriate invertible column operation A such that 

M -A- 



1 h2 

C3 hi + C4 



where C3,C4 are constants and /i2,/i4 are linear forms. 

We will defer the proof of this lemma to the end of this section, and shall use it to prove Theorem l4.31 
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Proof of Theorem \4-3\ Assume, on the contrary, that we do have such a sequence of matrices 
computing /. Since only one entry is of interest to us, we shall assume that the first matrix is a 
row vector and the last matrix is a column vector. Let the sequence of minimum length computing 
/ be the following: 

f = v ■ M1M2 ■■■Md-w 

Using Lemma 14.41 we shall repeatedly transform the above sequence by replacing MjMj+i by 
(MiA){A~^ Mi-^^i) for an appropriate invertible column transformation A. Since A would consist of 
just constant entries, MiA and A~^Mi^i continue to be linear matrices. 



To begin, let v = [/i, ^2] for two linear functions li and l2- And let [/i, f2]^ 
we have. 



" / " 




' h h' 




" /l " 












. h _ 



Ml--- MdW. Then 



Hence, by Lemma 14.41 we can assume v = [1, /i] and hence / = /i + /i/2- By the minimality of the 
sequence, /i / 0. This forces fi to be 4-robust and the degree restriction makes [f2\n = 0. 

Let [51,52]^ = M2 - ■ - MdW. The goal is to translate the properties that [fi]n is 4-robust and 
[/2]n = to the polynomials gi and §2- Translating these properties would show each Mj is of the 
form described in Lemma 14.41 Thus, inducting on the length of the sequence, we would arrive at 
the required contradiction. In general, we have an equation of the form 



/i 

/2 



Mi 



91 

92 



Since [fi]n is 4-robust, using Lemma 14.41 again, we can assume that 



/l ■ 




" 1 h2 ' 




91 


h . 




C3 C4 + 




. 92 _ 



(6) 



by reusing the variables 51, §2 and others. Observe that in the above equation if /14 = then 
Mi-iMi still continues to be a linear matrix (since, by induction, Mj_i is of the form as dictated 
by Lemma l4.4p and that would contradict the minimality of the sequence. Therefore /14 7^ 0. 



Claim: C3 = (by comparing the v}^ homogeneous parts of fi and gi , as explained below) . 
Proof: As /14 7^ 0, the degree restriction forces deg5'2 < And since deg/2 < n, we have the 
relation C3[(7i]„ = — /i4[52]n-i- If C3 7^ 0, we have [gi\n € (/14), contradicting robustness of as 
then [fi]n = [gi]n + /l2[fi'2]n-l G ih2,hi). □ 

Therefore Equation [6] gives. 



fl ■ 




" 1 h2 ' 




91 


h . 




C4 + /l4 




. 92 _ 



with /14 7^ 0. Also, since [/2]ra+i = [/2]n = this implies that [g2\n = b2]n-i = 0. Hence, 
[9i\n = [fi\n is 4-robust. This argument can be extended now to gi and g2- Notice that the degree 
of gi remains n. However, since there are only finitely many matrices in the sequence, there must 
come a point when this degree drops below n. At this point we get a contradiction as [gi\n = 
(reusing symbol) which contradicts robustness. □ 

We only need to finish the proof of Lemma 14.41 
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Proof of Lemma \4-4\ Suppose we have an equation of the form 



/l ■ 




hi + Cl /l2 + C2 




gi 


h . 




. ^3 + C3 /i4 + C4 




. 92 _ 



(7) 



On comparing degree n + 1 terms, we have 

hi[gi]n + h2[g2]n = 
halgiln + h4[g2]n = 

If /13 and /i4 (a similar reasoning holds for hi and /12) were not proportional (i.e. not multiple of 
each other), then the above equation would imply [gi]n, [g2]n £ (^3,^4)- Then, 

[fl]n = hi[gi]n^i + /l2[52]n-l + Cl[gi]n + C2[g2]n G (hi, h2, h^, h^) 

contradicting the robustness of [fi]n- Thus, /13 and /i4 (as well as hi and /12) are proportional, in 
the same ratio as [ — g2]n snd [fl'ijn' Using an appropriate column operation, Equation Ffl simplifies 
to 



/l ■ 




Cl /l2 + C2 




51 


h . 




C3 /l4 + C4 




. 52 _ 



If Cl = 0, then together with [g2]n = we get [fi]n = h2[g2]n-i contradicting robustness. Therefore 
Cl ^ and another column transformation would get it to the form claimed. □ 

5 Concluding remarks 

We give a new perspective to identity testing of depth 3 arithmetic circuits by showing an equiva- 
lence to identity testing of depth 2 circuits over U2(F). The reduction implies that identity testing 
of a width-2 algebraic branching program is at least as hard as identity testing of depth 3 circuits. 

We also give a deterministic polynomial time identity testing algorithm for depth 2 circuits over 
any constant dimensional commutative algebra. Our algorithm crucially exploits an interesting 
structural result involving local rings. This naturally poses the following question - Can we use 
more algebraic insight on non-commutative algebras to solve the general problem? The solution for 
the commutative case does not seem to give any interesting insight into the non-commutative case. 
But we have a very specific non-commutative case at hand. The question is - Is it possible to use 
properties very specific to the ring of 2 x 2 matrices to solve identity testing for depth 3 circuits? 
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A Appendix 

For the sake of completeness, we provide a proof of the result by Ben-Or and Cleve |BC88] . 

Theorem A.l. WC88^ Let E he an arithmetic formula of depth d with fan-in (of every gate) 
bounded by 2. Then, there exists a sequence of 3x3 matrices, whose entries are either variables or 
constants, of length at most 4*^ such that one of the entries of their product is E. 

Proof. The proof is by induction on the structure of E. The base case when E = c - Xi is computed 
as, 

1 

1 

€■ Xi 1 

Suppose E = fl + f2 and that we have inductively constructed sequences computing /i and /2. 
Then the following equation gives a sequence for E. 



1 




1 




1 


1 




1 




1 


. /l 1 . 




.h 1 . 




/l + /2 1 



li E = fl • /2 , then the following sequence computes E 



1 




" 1 




1 




" 1 




1 


-/2 1 




1 




/2 1 




1 




1 


1 




/l 1 . 




1 _ 




-/l 1 . 




/l/2 1 



Applying the above two equations inductively, it is clear that E can be computed by a sequence of 
length at most 4"^. □ 
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